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Abstract — In this paper, we characterize the decoding region 
of algebraic soft decoding (ASD) [4] of Reed-Solomon (RS) 
codes over erasure channels and binary symmetric channel 
(BSC). Optimal multiplicity assignment strategies (MAS) are 
investigated and tight bounds are derived to show the ASD can 
significantly outperform conventional Berlekamp Massey (BM) 
decoding over these channels for a wide code rate range. The 
analysis technique can also be extended to other channel models, 
e.g., RS coded modulation over erasure channels. 

I. Introduction 

Reed-Solomon (RS) codes are powerful error correction 
codes, which are widely employed in many state-of-the-art 
communication systems. However, in most of the existing 
systems, RS codes are decoded via algebraic hard decision 
decoding (HDD), which does not fully exploit the error 
correction capability of the code. 

Since the seminal works of [1] and [4], algebraic soft 
decoding (ASD) algorithms have come to research interest 
due to their significant performance gain over HDD. In [4], 
an asymptotically optimal multiplicity assignment strategy 
(MAS) has been proposed (it is asymptotically optimal in the 
sense that it maximizes the capacity [4] of ASD when the 
code length goes to infinity). However, the optimal MAS and 
corresponding performance analysis for finite length RS codes 
is difficult to obtain. Recent progress in this problem can be 
found in [2] and [5]. However, in these papers, the proposed 
schemes largely rely on numerical computation and give little 
insight into the decoding region of ASD. In [3], a general 
framework has been studied for channels with additive cost 
[3]. However, it is still interesting and of practical value to 
characterize the decoding regions for particular channels. 

In this paper, we first investigate the optimal MAS for 
ASD for binary erasure channel (BEC) and binary symmetric 
channel (BSC) and characterize the corresponding decoding 
region of ASD. It is shown that ASD has a significant gain 
over HDD for BEC and BSC. The analysis technique is then 
extended to some other discrete alphabet channels (DAC), 
e.g., RS coded modulation transmitted over erasure channels. 

'This work was supported in part by Seagate Research Inc., Pittsburgh, PA, 
USA. 



ASD is shown to consistently outperform conventional erasure 
decoding, which suggests the potential application of ASD to 
practical systems, such as DSL systems as described in [7]. 

II. Performance Analysis of ASD over BEC 

We first consider the case when RS codewords are trans- 
mitted as bits through a BEC with erasure probability e. 

In the multiplicity assignment state, since the true a poste- 
rior probability (APP) of each bit is not available, we take 
the reliability information observed from the channel, i.e, 
if one bit is erased in a symbol, we regard two candidate 
symbols as being equally probable. Furthermore, similar to 
[4] and [5], we assume that the codewords are uniform over 
F^. Consequently, there is no preference for some symbols 
over others and the multiplicity assigned to equally probable 
candidate symbols are assumed to be the same. 

We define each symbol that has a number of i bits erasures 
as being of type i. Consequently, for a code over GF(2 m ), 
there are (m+1) types of symbols. Let the number of symbols 
of type i in a received codeword be a;. As discussed above, 
we will assign equal multiplicity to symbols of the same type; 
whereas, the multiplicity assigned to type i may vary according 
to the received codeword. Define the multiplicity assigned to 
each candidate of symbol of type i as m*. Thus, the total 
multiplicity assigned to one symbol of type i is 2 l x m.;. 
According to the above notation, the score and cost can be 
defined as in [4] as follows: 
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The approximation in 10 becomes tight when becomes 
large. 

In general, the decoding radius of ASD is difficult to char- 
acterize due to its soft decoding nature. However, it is shown 
in [4] that the ASD is guaranteed to return the transmitted 
codeword when the following inequality holds: 



Lemma 1 The sufficient condition for ASD to list the trans- 
mitted codeword is as follows: 



S> y/2(K-l)C 



(3) 



Proof: This is proved in Corollary 5 in [4]. ■ 
This sufficient condition becomes a tight approximation 
when N is large. With a little bit abuse of notation, we 
approximate the decoding region of ASD using the region 
guaranteed by (0. Upper bound and lower bound are derived 
for this approximate decoding region, which facilitates the per- 
formance analysis in practical systems. Further, we consider 
ASD with infinite cost such that we can relax the multiplicity 
from integers to real numbers. It is justified by the fact that 
rational numbers are dense on the real axis and they can always 
be scaled up to be integers with infinite cost (see also [2]). 
We first investigate the optimal MAS for the BEC. 

Lemma 2 The proportional multiplicity assignment strategy 
(PMAS) is optimal for BEC regardless of the received signal. 

Proof: Assume that the received codeword has <ij sym- 
bols of type i. Since it can be readily shown that a larger 
cost will always lead to a better performance, the MAS can 
be formulated as maximizing the score with a cost constraint. 
With infinite cost, the problem is expressed as: 



max y < 

1 ' i=0 



a.m. 



subject to \ X)£Lo a * 2 ' m i < Co 
This is a standard optimization problem with linear cost 
function and quadratic constraint. Using a Lagrange multiplier, 
the new objective function becomes 



L = 



m ( \ 171 

i=0 \ i=0 
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(4) 



Take the partial derivative with respect to m; and set it to 
zero. We have: 

dL 



dm; 



-di + \2 % aivrii = 



(5) 



Therefore we have rrii = =^—, i.e., m t oc 2~\ which proves 
that PMAS is optimal. ■ 
Since PMAS is optimal, we will from now on assume that 
PMAS is used. Suppose for any received signal, it has 
type series as {a^}. Under PMAS, we assume that the total 
multiplicity for each symbol as M. Consequently, the score 
is So = Y^iLo Oi2~ l M = r\M and the cost is Co = 

iEr=o«. rtf2 = hM 2 , where r, = J2T=q^ 2 ^ is a 
positive number. The sufficient condition of becomes: 



So > V2(K - 1)C 
7] > K- 1 



(6) 
(7) 



In the following lemmas, we will first derive the worst case 
erasure pattern for ASD over BEC. 



Lemma 3 If a received codeword can be decoded, it can 
always be decoded if some of the erasures are recovered. 

Proof: The proof is immediate by the fact that if some 
of the erasures are recovered, the score will increase and the 
cost will decrease. Consequently, the sufficient condition (0) 
is still satisfied. ■ 

Lemma 4 Given i-bit erasures, the worst case erasure pattern 
for ASD under PMAS ( in terms of the guaranteed decoding 
region) is that all the bits are spread in different symbols as 
evenly as possible. That is: (N — i+ \_jf\N) symbols contain 
bit errors and (i — \_jj\N) contain bit errors. 

Proof: The lemma can be readily verified by induction. 
Take two arbitrary symbols of type i and j, if we average the 
bit erasures between these two, we get two symbols of type 
\J-^-\ and r^r-]- The updated rf can be expressed as: 

77' = r? + +2-^1 -2-*-2 -i <n (8) 

From (0, it is clear that when r] increases, the erasure pattern 
becomes better for ASD. Thus, since 77 > 77', the later erasure 
pattern is worse. By repeating the above procedure, we can 
finally get the worst erasure pattern for ASD under PMAS, i.e., 
the bit erasures are spread as evenly as possible in different 
symbols. ■ 
The asymptotic bit-level decoding radius e (i.e., the worst 
case erasure pattern) for ASD can be formulated as a standard 
optimization problem: 

mine = }^i=o le i 

s.t Bi > 0, i = 0, 1, • • • , m 

and Ya= iei = N,rj < K - 1 

The above problem can be solved numerically using integer 
linear programming. However, for practical medium to high 
rate RS codes, we can have a simpler form of the radius. 

Theorem 1 The bit-level decoding radius for ASD can be 

expressed as: e = 2(N - K + 1) for K > |iV + 1 and 
e = 3iV - i(K -l)for\N+l<K<\N + l. 

Proof: According to Lemma 0] the worst case erasure 
pattern is all erased bits are spread evenly over different 
symbols. Assume the erased bits e < 2N. Thus, the worst 
case erasure event consists of e\ symbols of type 1 and e2 
symbols of type 2. The problem can be formulated as: 

min e = e-y + 2e2 (9) 
s.t. ei > 0,e 2 > Q,ei + e 2 < N 



and rj = N - 



1 3 

-ei e 2 < K - 1 

2 4 



After some straightforward manipulation, we can obtain the 
optimal value e*, e 2 and e* . 

For K > + 1, it is guaranteed that in the worst case 
erasure pattern, each symbol contains no more than 1 bit 
erasure. The optimal value can be computed as: 



e* = 2(N- K+ l),e* 2 = 
e* = 2[N -{K- 1)] 



(10) 
(11) 



For |iV 



- 1 < K < TjN + 1, the optimal value can be 
expressed as: 

e* = 4(# - 1) - N, e* 2 = 2[N - 2{K - 1)] 
e* = 2{N -{K-l)]+N- 2(K 



1 



(12) 
(13) 



It is easy to verify that e* < 27V in this case. Also note that 
e* > 2[iV — (K — 1)], which suggests as the code rate goes 
lower, the bit-level decoding radius gets larger and larger. ■ 
TheorernHgives an FER upper bound on ASD performance 
under PMAS with infinite cost. 

Corollary 1 For codes of rate for R > | + -h, the sufficient 
condition cannot be satisfied when there are more than 
2{N — K + 1) symbols in error. 

Proof: The corollary follows from ( II 11 1 and Lemma [3] 
If there are more than 2{N — K + 1) symbols having erased 
bits, the most optimistic case is that these symbols are of type 
1. Besides, due to il Q , the sufficient condition is not satisfied 
and ASD can not guarantee to decode. ■ 
Corollary ^ can serve as a lower bound of the guaran- 
teed region of ASD. The bounds derived in Theorem ^ an d 
Corollary ^ are shown in Figure ^ m conjunction with the 
maximum likelihood (ML) performance union bound over 
RS averaged ensemble [6], ML performance of RS averaged 
ensemble is very close to the capacity of BEC, which suggests 
RS codes are good codes. Besides, ASD has a significant 
performance gain over conventional BM erasure decoding. 
This is intuitively true, since we do not have to erase the whole 
symbol if part of the bits in that symbol is erased, which can 
be taken advantage of by ASD. It can be seen from the figure 
that for practical high rate long codes, both the upper and 
lower bounds are tight and they together accurately indicate 
the performance of the guaranteed region of ASD down to 
FER = 10 -20 (this is not shown here due to the page limit). 
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However, we will show that symbol-level soft information can 
still be utilized by ASD once proper MAS is applied. 

Similar to the BEC case discussed in Section HJ we assume 
that equally probable candidate symbols are assigned equal 
multiplicity. We also follow the same notations except that type 
i is the type that i bits in that symbol get flipped. Similar to the 
BEC case, the problem can be reformulated as a constrained 
optimization problem: 

max S = 2^ i=0 a-iTUi 

C=§ o C) x mf < Co, 

However, unlike the BEC case, we do not know at 
the decoder. Consequently, the MAS should be the same for 
any received codeword. Here, we resort to the asymptotically 
optimal MAS, i.e., we try to decode up to the largest bit-level 
radius for the worst case error pattern. The problem can be 
viewed as a max-min problem over {c^} and {mi}. 



maxminN a^m.; 



(14) 



subject to a i = N where {a^js are integers 

and f ET=o (?) x < C . 

The above problem is quite complicated, since {a;}s are 
integers. Even if that condition is relaxed, the solution may 
only be obtained numerically, which does not give any insight 
into the exact decoding radius of ASD. 

We first take one step back and consider a special case of 
BSC, called 1-bit flipped BSC, i.e., in each symbol, at most 
one bit is in error. By doing that, we only have to assign 
multiplicities to two types of symbols. Suppose there are e 
symbols in error, e < N and the score and cost are S = 
(N —e)mo+emi and C — ^[rn^+mml]. The asymptotically 
optimal MAS is just to maximize e. Suppose mi = tmo, 
where the multiplicity coefficient Q < t < 1, the sufficient 
condition in (0 becomes: 



[(N - e) + et]m > y '{K - 1)N(1 + mt 2 )m (15) 
N - V 'N \K - 1)(1 + mt 2 ) 



e < 



1 - t 



(16) 



Let t = 1 in dl5> . we have the inequality independent of e. 
The inequality always hold as long as N > (K — 1)(1 + to), 
which is true for low rate codes, i.e., for a rate 



R < 



1 



1 



1 

N 



(17) 



Fig. 1. Bounds and Simulation Results of RS(255,239) over BEC 



III. Performance Analysis of ASD over BSC 

In this section, we study the performance of ASD over BSC. 
Since there is no bit-level soft information available, any bit- 
level reliability based soft decoding algorithm does not work. 



In this case, setting to = TOi will correct all errors in the 
received signal. On the other hand, for higher rate code, the 
optimal MAS is to optimize t to maximize the RHS of dl 61 . 

The problem is equivalent to finding a point on the hyper- 
bola N J { _ 1 - ) — rnx 2 = 1 within interval x E [0, 1) such that 
a line passing through that point and the given point (1,/V) 
has the maximal slope. This is nothing but the tangent to 



the hyperbola. For the tangential point (xo,yo)> we nave the 
following relationships: 

-r- \x=x = N{K - l)m— = (18) 

dx y 1 - £o 



»o 2 



— TUXq = 1 



(19) 



N(K- 1) 
We define A as: 

A = {m(K - l)) 2 + (N - K + l)(m 2 (K - 1) - mN) (20) 

Plug in yo into d!9i . we can get: 

-m(K- 1) + VA 



X 



2 (K - l)-mN 



(21) 



Using (tT8t and ( fT9t . the optimal value of the slope in terms 
of xq is: 



dy Nm 
d= dx~ lx=x ° = r 



m - 



Thus [d\ is the exact error correction radius of ASD al- 
gorithm under asymptotically optimal MAS over 1-bit flipped 
BSC. Besides, J2 lb can be further approximated for some long 
and high rate codes. Observe that (N — K + l)(m 2 (K — 1) — 
mN) <C (m(K — l)) 2 for high rate codes, using first order 
Taylor expansion, we get: 



nr , \ r 1 (N — K + l)(m 2 (K — 1) - mN) 1 



Plug this into (12 It and (I22> . we get: 

~_ N(N-K + 1) 
~ N + (K-1) 



(23) 



(24) 



Note that the approximation becomes tight when rate 
is high, however, it should be noted that the performance 
improvement is significant only when the rate is low. For 
instance, for N = 255, K — 223 ASD does not improve 
the performance, for N = 255, K = 167 ASD gives an extra 
error correction capability over GS decoding, for N — 255, 
K = 77, it corrects 7 more errors and for N — 255, K = 30, 
it corrects 45 more errors. For K < 30, all errors can be 
corrected for this 1-bit flipped BSC. 

Now, we show that the above MAS is also asymptotically 
optimal for RS codes over BSC for a wide code range. 

Lemma 5 Given a number of i < N bits errors and the 
optimal multiplicity coefficient t < i, the worst case error 
pattern for ASD algorithm is that all erroneous bits are spread 
in i different symbols. 

Proof: Assume that there are e bits flipped by the channel. 
The cost for BSC channel does not change when the MAS is 
fixed. The worst case error pattern is the one that minimizes 
the score. In the above MAS, multiplicities are assigned only 



to the received symbol and its 1-bit flipped neighbors. Thus, 
a potential error pattern which is worse than the 1-bit flipped 
BSC will try to group bits in each symbol to reduce the score. 
The original score can be expressed as: 

S = M[{N-e)+te] (25) 

Let the worst case error pattern has e' symbols containing 
1 bit error and e" symbols containing more than 1-bit error. 
Evidently, for symbols containing more than 2-bit errors, we 
can always further decrease the score by splitting these bit 
errors into one symbol containing 2-bit errors and the other 
containing the rest of the errors. Consequently, the worst case 
error pattern will contain symbols with at most 2-bit errors. 
We have e' + 2e" = e. The score becomes: 

S" = M[(N -e'- e") + te'} = M[(N ~e) + et + e"(l - 2t)] 

(26) 

When t < h, S" > S, which proves that spreading all the bits 



(22) in different symbols is the worst case error pattern. 



Theorem 2 For BSC, the asymptotic optimal MAS of ASD 
algorithm can guarantee to decode up to d bits, where d is 
computed in \22\ . given d < N and the optimal multiplicity 
coefficient t < |. 

Proof: According to lemma |5j all bits spread in dif- 
ferent symbols are the worst case error pattern for the ASD 
algorithm, which is nothing but the 1-bit flipped BSC. Thus, 
they will be the asymptotically dominating error patterns. On 
the other hand, it is shown before that the proposed MAS is 
asymptotically optimal for 1-bit flipped BSC, i.e., maximizing 
the asymptotic decoding radius d. Consequently, the proposed 
MAS will guarantee to decode all error patterns with fewer 
than d-bit errors over BSC as well. ■ 

The error correction radii as a function of t are given in 
Figure |2] It can be seen that the optimal MAS (which is 
achieved by t = 0.2) corrects 13 and 50 more bit errors than 
GS and BM asymptotically. Besides, we also plot bit-level 
radius of PMAS, where the x-axis is the crossover probability 
p c of the BSC. Note that PMAS is not asymptotically optimal 
for BSC. Even though we choose p c to maximize the bit-level 
radius (around p c = 0.13), the bit-level decoding radius is 
still 1 bit smaller than that of the optimal MAS. The reason 
can be explained as follows: the worst case error pattern of 
BSC is shown to be all bit-level errors spread in different 
symbols, thus, the asymptotically optimal MAS only has to 
assign multiplicities to symbols of type and type 1. On the 
other hand, KV algorithm assigns multiplicities proportionally. 
Thus it also assigns multiplicities to candidate symbols with 
more than 1-bit flip, which makes it suboptimal in terms of 
bit-level decoding radius. 

We consider the performance of this asymptotically optimal 
MAS in Figure |3l Consider AWGN channel with 1-bit quanti- 
zation before decoding, which is equivalent to BSC. The ASD 
under the proposed MAS outperforms GS decoding and HDD 
by 1.6dB and 3.5dB respectively at an FER = 10 -5 . 
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Fig. 2. Bit level decoding radius of an RS (255,55) code 
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Fig. 3. Performance Comparison for RS (15,3) code 



IV. Performance Analysis of ASD of RS Coded 
Modulation over Erasure Channels 

We extend the result from Section [H] to RS coded modu- 
lation transmitted over erasure channels, i.e. u bits of coded 
information are grouped together and transmitted using a 2"- 
ary QAM or PSK modulation format. The channel will erase 
the signal with erasure probability e. Practical channels of this 
model are discussed in [7]. 

In Section [H] we showed that PMAS is optimal for erasure 
channels. Clearly, all erasure patterns in this coded modulation 
model is a subset of the erasure patterns of BEC. Thus, PMAS 
is also optimal for this channel model. Let each PSK (or QAM) 
symbol contains u bits. Without loss of generality, we assume 
u divides to, i.e., to = lu. Thus, for each symbol, we have 
(I + 1) types. 

Lemma 6 The worst case erasure pattern for ASD under 
PMAS is that all erasure events are spread in different symbols 
as evenly as possible. 

Proof: Assume two RS symbols are of type i and j, we 
can average all the erasures events between the two symbols, 
we have: 



Similar to Lemma |4] when r\ decreases, spreading erasure 
events in different RS symbols evenly is the worst case. ■ 

Theorem 3 ASD under PMAS can guarantee to decode up to 
(N - K + 1)/(1 - 2 -u ) erasure events if R> 2~ u + i. 

Proof: According to Lemma [6] spreading erasure events 
in different symbols is the worst case erasure pattern if K > 

2~ u N+l. 

Assume that e symbols of type 1 are erased and the rest 
(N — e) symbols are of type 0. Thus ?/ — N — (1 — 2~ u )e. 
According to @ when the following condition is satisfied: 



e< (N- K +!)/(! -2- u ) 



(28) 



ASD is guaranteed to decode the transmitted codeword. ■ 
Note that this asymptotic decoding radius is consistently larger 
than the conventional RS erasure decoding region, e = N — K 
erasure events. Note that d28l is a generalization of ( II It in 
Theorem ^ (with u = 1 as special case). 

V. Conclusion 

We have presented optimal multiplicity assignment strate- 
gies and performance analysis of algebraic soft decision de- 
coding over erasure channels and BSC. It was shown that ASD 
under optimal MAS can significantly outperform the BM and 
GS algorithm. 
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